Skip to content

fix FacDB FACTYPE bugs#2360

Merged
damonmcc merged 12 commits into
mainfrom
dm-facdb-bugs
May 12, 2026
Merged

fix FacDB FACTYPE bugs#2360
damonmcc merged 12 commits into
mainfrom
dm-facdb-bugs

Conversation

@damonmcc
Copy link
Copy Markdown
Member

@damonmcc damonmcc commented May 8, 2026

resolves #2359

all builds on this branch

changes

  • improve the s3_compare marimo notebook used to compare build files
  • general cleanup of FacDB build scripts
  • add minimal dbt project setup
  • add test for NULL FACTYPE values
  • add test for many-to-one relationship between FACTYPE and FACSUBGRP
  • fix failing tests by:
    • expanding lists of source data values mapped to FACTYPE values
    • improving mappings of FACTYPE and FACSUBGRP to avoid many-to-many relationships

follow up work

as part of QA for the next FacDB release these mapping improvements and changes will be reviewed more thoroughly than usual. since this PR is about fixing nulls and relationships for AE's dev work, the actual mappings aren't as important as tests passing

validation/tests

new build tests passing here

comparing main and dev builds using the s3_compare marimo notebook

  • Screenshot 2026-05-12 at 10 58 39 AM

same source data

  • Screenshot 2026-05-12 at 10 58 59 AM

different counts of records when grouped by FACSUBGRP according to the qc_classification.csv output

for example, there are now 15 more IMMIGRANT SERVICES records

  • Screenshot 2026-05-12 at 11 03 10 AM

@damonmcc damonmcc changed the title start facdb dbt project fix FacDB FACTYPE bugs May 8, 2026
@codecov
Copy link
Copy Markdown

codecov Bot commented May 8, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 71.76%. Comparing base (a736b94) to head (3f20e57).

Additional details and impacted files

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@damonmcc damonmcc force-pushed the dm-facdb-bugs branch 11 times, most recently from 82ea154 to 419d6b0 Compare May 12, 2026 14:48
@damonmcc damonmcc marked this pull request as ready for review May 12, 2026 14:48
Copy link
Copy Markdown
Member Author

@damonmcc damonmcc May 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

new tests here. the null FACNAME test isn't related to this PR but want to revisit it later for general data quality improvements

@damonmcc damonmcc requested review from alexrichey and pratishta May 12, 2026 15:04
Copy link
Copy Markdown
Contributor

@alexrichey alexrichey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Love it, esp the dbt'ization. I do think at some point we need to rip the cleaning logic out of facdb and into ingest, which would perhaps enable us to track changes to those datasets a little more cleanly. But that's a whole endeavor of its own.

@damonmcc damonmcc merged commit 85258bc into main May 12, 2026
24 checks passed
@damonmcc damonmcc deleted the dm-facdb-bugs branch May 14, 2026 10:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

FacDB FACTYPE bugs

2 participants